Clustering E-Mails for the Swedish Social Insurance Agency - What Part of the E-Mail Thread Gives the Best Quality?
نویسندگان
چکیده
We need to analyse a large number of e-mails sent by the citizens to the customer services department of a governmental organisation based in Sweden. To carry out this analysis we clustered a large number of e-mails with the aim of automatic e-mail answering. One issue that came up was whether we should use the whole e-mail including the thread or just the original query for the clustering. In this paper we describe this investigation. Our results show that only the query and the answering part should be used, but not necessarily the whole e-mail thread. The results clearly show that the original question contains more useful information than only the answer, although a combination is even better. Using the full e-mail thread does not downgrade the result.
منابع مشابه
Increasing the Efficiency and Quality of E-mail Communication in E-government Using Language Technology
E-government includes electronic communication between citizens and governmental agencies. In the present on-going research project, we have focused on asynchronous communication that handling officers establish and maintain with citizens through the use of e-mail. In particular, we are designing and developing a language technology-based system to support communication that handling officers c...
متن کاملA Genre Analysis of Reprint Request E-mails Written by EFL and Physics Professionals
The present study aimed to analyze reprint request e-mail messages written by postgraduates (MA students) of two fields of study, namely Physics and EFL, to realize the differences and similarities between the two email types. To investigate the purpose of the study, a sample of 100 e-mail messages, 50 Physics and 50 EFL, were analyzed according to Swales’ (1990) model for reprint requests and ...
متن کاملComparing Manual Text Patterns and Machine Learning for Classification of E-Mails for Automatic Answering by a Government Agency
E-mails to government institutions as well as to large companies may contain a large proportion of queries that can be answered in a uniform way. We analysed and manually annotated 4,404 e-mails from citizens to the Swedish Social Insurance Agency, and compared two methods for detecting answerable e-mails: manually-created text patterns (rule-based) and machine learning-based methods. We found ...
متن کاملApplied Linguistics Faculty Members’ Perceptions of (Im)politeness and (In)appropriateness of L2 Learners’ E-Mail Requests
A significant amount of contribution to pragmatics research comes from cross-cultural and developmental pragmatic studies with L2 learners in focus; however, despite broad interest in such analyses, the role of lecturers has been relatively ignored. As the lectures’ perceptions/opinions of L2 learners’ e-mail requests are important, L2 learners must become familiar with their lecturers’...
متن کاملA Critical Functional Approach to Educational Discourses of Students and Professors over the Internet Context
This paper investigated the ways Iranian B.A and M.A students of English language and their professors represent themselves linguistically in their e-mails in general, and the ways they construct and negotiate power with regard to social and cultural norms in particular. It examined 84 e-mail messages students and professors exchanged in 2012-2013 academic year through Halliday`s Systemic Funct...
متن کامل